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ABSTRACT 

This  paper  presents  the  negative  cate  in  a  debate  on  the 
question:  "Resolved;  Information  retrieval  is  now  an  es¬ 
tablished  scientific  discipline  with  well-defined  goals, 
methods,  and  evaluation  techniques."  Ihe  these  of  the  nega¬ 
tive  argument  is  that  information  retrieval  is  not  a  science. 
Rather,  it  is  an  applied  field,  which  deals  with  pressing, 
practical  problems.  Its  w'orkers  are  primarily  concerned  with 
achieving  satisfactory  working  solutions,  and  only  secondarily 
concerned  with  two  factors  that  characterize  research  in  true 
scientific  disciplines--the  basic  understanding  of  some  aspect 
of  nature  and  the  development  of  theory.  Some  philosophical 
definitions  of  science  are  quoted,  and  a  comparison  is  made 
between  IR  and  what  is  clearly  an  applied  field,  the  desalini¬ 
zation  of  sea  water.  The  paper  concludes  that  although  IR  Is 
not  yet  a  scientific  discipline,  workers  in  the  field  have  at 
least  achieved  the  level  at  which  they  can  recognize  the  basic 
scientific  problems  underlying  IR. 
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IS  INFORMATION  RBTRI EVAL  NOW  AN 
ESTABLISHED  SCIENTIFIC  DISCIPLINE? 

%  purpose  is  to  present  the  negative  case  i;i  a  debate  on  the  following 
question:  "Resolved;  Information  retrieval  is  now-  an  established  scientific 
discipline  with  well-defined  goals,  methods,  and  evaluation  techniques." 

The  crux  of  our  debate  is  the  definition  of  the  words  "scientific  discipline." 

1  take  this  phrase  to  be  intended  to  imply  a  context  of  well-defined  goals, 
methods,  and  evaluation  techniques  which  have,  as  part  of  their  "well-definedness, 
the  kind  of  cohesiveness,  coherence,  and  approach  toward  basic  understanding 
that  characterize  a  science.  It  is  ay  contention  that  IF  is  not  a  scientific 
discipline  because  it  is  something  else:  namely,  it  is  an  applied  field  whose 
essence  is  the  attempt  to  solve  an  important,  pressing,  practical  problem  by 
whatever  means  are  available  or  can  be  developed.  Indeed,  the  eclecticism  of 
IR  is  one  of  its  prime  characteristics,  and  one  which  helps  to  distinguish  it 
from  a  scientific  discipline.  It  might  seem  more  akin,  perhaps,  to  an  engineering 
discipline,  but  I  shall  later  suggest  an  example  that  I  think  will  persuade 
you  that  IF  is  not  even  a  discipline  within  the  fields  of  engineering. 

What  are  the  characteristics  of  a  science  or  a  scientific  discipline?  I 
cannot  give  a  lecture  here  on  the  philosophy  of  science,  but  I  think  it  is 
important  to  consider  a  few  things  about  what  a  science  is,  about  what  a  science 
is  not,  and  about  what  is  not  a  science, 

Braitbwaite  [l]  says:  "Hie  function  of  a  science... is  to  establish  general 
laws  covering  the  behavior  of  the  empirical  events  or  objects  with  which  the 
science  in  question  is  concerned,  and  thereby  to  enable  us  to  connect  together 
our  knowledge  of  the  separately  known  events,  and  to  make  reliable  predictions 
of  events  as  yet  unknown." 

Ackoff  [2],  even  more  pertinently  to  the  point  at  issue  in  our  debate,  says: 

"We  can  consider  science  as  a  process  of  inquiry;  that  is,  as  a  procedure  for 
(a)  answering  questions,  (b)  solving  problems,  and  (c)  developing  more  ef¬ 
fective  procedures  for  answering  questions  and  solving  problems,"  Bfy  con¬ 
trasting  question-answering  and  problem-solving,  Ackoff  is  leadir^j  up  to  the 
point  of  my  argument.  He  goes  on  to  say:  "Not  all  inquiry  is  scientific. 

There  is  a  large  class  of  nonscientific  inquiry,  including  what  we  call  'common 
sense'  inquiry. ...  It  has  been  proposed  in  the  past  that  common-sense. . -inquiry 
is  concerned  with  more  immediate  and  practical  problems  than  is  science.  It 
has  become  increasingly  clear,  however,  that  science,  particularly  'applied 
science,'  does  deal  with  Immediate  and  pressing  problems." 

lb'1  distinction,  then,  between  science--in  particular,  applied  seience--and 
the  common- sense  kind  of  problem-solving  to  which  I  maintain  IR  belongs,  lies 
not  so  much  in  the  practicality  of  the  problems  as  in  (so  I  am  arguing)  the 
highly  eclectic  nature  of  the  solutions  achieved  or  attempted.  But  this  is 
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not  the  only  distinction,  I  should  Hie  to  try  to  bring  out  at  least  one  more 
kind  of  distinction  by  referring  once  again  to  a  philosopher  of  science, 

Meblberg  [3]  characterizes  a  science  as  a  collection  of  data,  theories,  and 
validation  methods.  He  defines  a  theory  as  a  coherent  and  cohesive  set  of 
laws.  Laws  must  be  pertinent  and  unambiguous  answers  to  the  question  under 
consideration.  Ihe  laws  must  be  supported  by  the  available  data,  which  must 
be  sufficiently  reliable,  numerous,  and  varied  to  furnish  adequate  support. 

A  theory  Bhould  consist,  says  Meblberg,  of  "relatively  few  explicitly  formu¬ 
lated  statements  from  which  a  theoretically  Infinite  set  of  propositions  can 
be  derived  by  a  specifiable  logical  procedure." 

But  according  to  Mehlberg  a  theory  must  consist  of  answers  to  questions,  and 
these  answers  must  be  supported  by  data.  If  a  theory  is  to  contain  a  theo¬ 
retically  infinite  set  of  propositions,  how  can  they  all  be  supported  by  the 
finite  set  of  data  offered  in  the  presentation  of  the  theory?  The  way  out  of 
this  transfinlte  difficulty,  according  to  Mehlberg,  is  that  the  scientist  offers, 
along  with  his  theory  aixi  data,  a  description  of  the  methods  he  has  used  in 
arriving  at  bis  answers- -methods  which  can  then  serve  as  validation  procedures 
for  testing  any  desired  one  cf  the  theoretically  infinite  set  of  propositions 
that  are  logically  derivable  from  the  theory.  Meblberg  holds  that  these  vali¬ 
dation  methods  "form  the  gist  of  the  whole  method  of  Bcience"  (a  strong  statement 
indeed,  which  he  devotes  most  of  his  book  to  justifying).  If  a  validation 
method  turns  out  also  to  have  predictive  power,  that  is,  to  answer  questions 
besides  those  it  was  designed  to  answer,  then  so  much  the  better. 

ihus  Braithveite,  Aekoff,  and  Mehlberg  agree  that  a  science  must  have  at  least 
the  following  characteristics: 

a)  it  must  provide  answers  to  questions; 

b)  the  answers  oust  be  consistent  with  empirical  data; 

c)  there  must  be  a  unifying  structure  that  incorporates  the  answers  and 
makes  it  possible  to  extend  them;  and 

d)  predictions  should  be  possible,  and  if  they  exist,  oust  be  testable. 

1  have  cited  these  philosophers  of  science  to  add  the  weight  of  authority- -for 
whatever  that  may  be  worth--to  my  claim  that  IR  is  not  a  science  but  instead 
is  an  area  of  application  of  eclectic  ccmaon  sense:  it  is,  at  worst,  "a  bag 
of  tricks,"  but  better--and  I  think  more  accurately--lt  is  a  set  of  useful 
techniques  for  achieving  a  practical  goal.  I  think  IF  research  can  be  viewed 
as  having  both  an  intermediate  suhgcal  and  a  final  goal.  The  intermediate 
subgoal  is  that  of  improving  methods  of  document  retrieval;  the  final  goal  is 
to  achieve  what  is  called— with  a  nebulousness  that  is,  at  least  for  the  present, 
unavoidable- - "fact  retrieval . " 
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It  Is  certainly  true  that  achieving  these  two  goals  will  involve  progress  in 
accse  indisputably  scientific  disciplines.  Chief  among  these,  I  think, will  be 
progress  in  achieving  an  adequate  understanding  of  the  process  of  verbal  com¬ 
munication*  Progreso  here  will  surely  Involve  the  disciplines  of  linguistics 
and  mathematics,  and  say  involve  other  disciplines  before  the  job  is  completed. 

<3ne  might  also  expect  that  progress  in  the  "hardware"  areas  of  science  will 
sooner  or  later  be  applicable  to  same  of  the  equipment  problems  of  IB;  for  ex¬ 
ample,  cryogenics  may  cose  to  have  an  application  here.  But  I  think  it  is  also 
certainly  true  that  IR-related  work  in  such  fields  has  so  far  been  carried  out 
mostly  by  persons  who  do  not  consider  themselves  to  be  information  retrievalists. 
Bather,  most  people  in  IB  have  come  from  either  the  computer  field  or  from  the 
library  field.  They  have  come  into  IB  in  order  to  solve  immediate,  pressing, 
practical  problems.  Their  primary  motivation  is  to  achieve  certain  practical 
goals  rather  than  to  develop  theories  of  IB  or  to  create  a  logical  entity  having 
the  characteristics  of  question-answering.,  consistency  with  empirical  data,  a 
unifying  structure,  and  testable  predictions --the  characteristics  U,«t  Braithwaite, 
Ackoff,  and  Mehlberg  agree  on  in  defining  a  science.  The  distinction  between  a 
scientific  discipline  and  a  problem- solving  effort  is  illustrated  here  by  the 
distinction  between  the  attempts  to  gain  basic  understanding  of  how  verbal  com¬ 
munication  is  accomplished,  and  the  attempts  to  build  this  or  that  document  re¬ 
trieval,  sentence  retrieval,  or  formatted -fact  retrieval  system. 

At  the  risk  of  being  tedious,  let  me  repeat  that  the  distinction  lies  in  whether 
the  goal  is  primarily  a  pragmatic  one — to  achieve  a  functioning,  man-machine 
system  that  satisfies  same  practical,  pressing  need — or  whether  the  goal  is  to 
achieve  an  ever  deepening  understanding  of  some  facts  of  nature  by  creating  or 
extending  a  structure  of  intimately  interrelated  fact  and  theory.  Only  the 
latter  goal  can  justify  the  label  of  "scientific  discipline." 

As  a  further  illustration  of  the  distinction,  let  us  consider  another  area, 
which  I  am  sure  no  one  would  call  a  scientific  discipline,  and  which  I  think 
is  not  even  an  engineering  discipline  in  the  sense  that,  say,  hydraulic  engi¬ 
neering  is  a  discipline.  I  have  in  mind  the  problem  of  the  desalinization  of 
sea  water. 

The  desalinization  problem  shares  several  characteristics  with  IR.  It  is  a 
pressing  problem;  it  needs  practical  solutions;  the  cost  of  possible  solutions 
is  a  very  important  factor  in  Judging  them;  and  techniques  of  many  kinds,  from 
many  fields,  are  being  tried  out  as  possible  solutions.  We  can  get  an  idea  cf 
the  variety  of  techniques  by  excerpting  from  the  table  of  contents  of  a  recent 
report  [hi  from  the  Department  of  the  Interior  on  techniques  under  study  in 
1962  as  possible  means  of  desalinization: 

Distillation  Processes 

Multistage  flash 

Vapor  reheat  flash  and  liquid-liquid  heat  exchange 

Thin  film  vapor  compression 

Wiped  film- -fluted  tube  thin  film 
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Fluidi zed  bed 

Abeorptlon~-saulti8tage  flash 

Heat  pump 

nunldification  Processes 
Hbltiple -effect  humidification 
Solar  stills 
Diffusion  still 

Membrane  Processes 
Electrodialysis 
Reverse  osmosis 
Vapor  gap  osmosis 

Ion  Exchange  Processes 

Freezing  Processes 

Hydrate  Processes 

Solvent  Extraction  Process 

Other  applied  research  [including  algae,  high-frequency  sound,  and  high- 
frequency  electric  currents] 

Gaspare,  if  you  vd.ll,  this  long  list  of  techniques  that  hopefully  may  prove 
useful  in  deaalinization  with  the  many  and  varied  techniques  that  have  been 
used  or  tried  in  IR.  (I  shall  refrain  here  from  listing  any  IB  techniques, 
for  twu  reasons:  first,  anyone  in  the  field  is  well  aware  that  the  techniques 
have  been  many  and  varied;  and  second,  it  would  be  unfair,  to  persons  who  have 
associated  themselves  with  any  of  the  techniques  I  might  give  as  examples,  to 
single  out  any  techniques  for  mention  in  the  present  context.  )  Such  a  com¬ 
parison  will  confirm,  I  think,  the  eclectically  pragmatic  orientation  of  most 
current  work  in  IR. 

All  this  is  not  to  say  that  IR  cannot  become  a  scientific  discipline.  On  the 
contrary,  I  think  it  can  and  will.  There  is,  after  all,  a  truly  scientific 
problem  underlying  IR,  Indeed,  it  is  the  enormous  difficulty  of  this  problem 
that,  combined  with  the  great  pressures  for  at  least  symptomatic  solutions,  has 
thus  far  kept  IR  from  becaaing  a  scientific  discipline.  This  basic  problem  is 
that  of  the  nature  of  information,  especially  information  conveyed  in  the  fora 
of  natural  language.  It  is  one  thing  to  transmit  information  from  a  sensor  to 
a  reactor,  as,  for  example,  when  a  thermocouple  senses  that  the  water  in  a 
heater  has  reached  a  temperature  of  180°  F.  and  sends  out  a  signal  that  shuts 
off  the  fuel  supply.  It  is  another  thing  entirely  when  I  utter  the  sentence, 
"What  is  the  maximum  safe  working  temperature  of  this  water  heater?"  We  do  not 
yet  have  the  tools  to  understand  the  ccoplex  af  difficulties  that  such  a  sentence 
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obtrudes  against  attempts  to  specify  a  procedure  for  analyzing  it  rigorously. 

But  we  have  at  least  cosae  to  recognize  vh&t  such  tools  will  require:  viz,f 
as  understanding  of  the  intricate  relationships  among  the  information  user, 
the  written  or  eposes  word.  and  the  real  world.  As  we  progress  toward  achieving 
that  understanding*  IB  can  become  a  scientific  discipline. 
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